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ABSTRACT 



This repbrt discusses the notion df« "language proficiency, 11 M l imited 
language proficiency," and ul tirrtatejy "comparably 1 imi ted (language 
prof iciencyV' ascbhcepts in need of further definition and . 
clarification. Studies critical of the test instruments used for ntaking 
assessments according to these concepts are presented. The author §oes 
on to discuss the specific effect these concepts have had on proposed 
legislation affetting bilintfuaJ education. Then, some scteibl ihguistic 
considerations are suggested for. why test results may be inaccurate 
and/or invalid. Finally, five major points are summarized having a 
bearing on continued research in this area, and four hypotheses are 
proposed concerning the relationship of conventional language 
proficiency testing, actual speech behavior in naturalistic situations ^ 
'and school achievement . , 
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REPORT ON THE STUDY OF LIMITED LANGUAGE PROFICIENCY - ( 
Benji Wald . 

r ' 

Introduct ion ■ ; 

The terrti "semi 1 ingual i sm" (further discussed belowV bas not * 
received wide 'currency in the discussion of American educational 

problems relevant to bilingual education, although the underlying 

_ . * _ _ » # ~, 

concept is much evident in such documents as -the Federal Register undec 
the label "comparably limited, 11 which will be herein referred to as 
"assessed comparably limited, 11 or "alleged comparably limited 1 - 1 
(hereafter ACt) on the basis of the failure of children of this group to 
attain a "proficient 11 status iri any tested language according to one or> 
another of a number of currently competing languages proficiency t 
assessment instruments. 

« 

Both the general issue^of language proficiency assessment and the ^ 
more particular issue of whether the category "comparably limited"^ is 3 
valid one nee'd rigorous study which goes beyond current work. In terms 
of research strategy, the two issues conflate, there is reason .to 
believe that the category "comparably limited" has not been demonstrated 
to be vatiql for* the majority of the children sd-labled;" we hypothesize, 
then, that defects other than actual language proficiency are 
responsible for the resultant assessments, most notably in the 
administration, design, and .under lying assumptions of the tests. Under 
the hypothesis presented here, at ieast part of the responsibility for 
the lack of validity of the cajW^jory "comparably limited," and of 
language proficiency tests in general, Is the lack of a sufficiently 
developed notion of what language proficiency is- and how it Can fee 
measured in a way rele vant to projected school^ achievement. 

Wha t is Language Proficiency? 

Underlying conventional test asse^ment is a cbhcept of language 
proficiency based on the common sense notion that people have different 
degrees of competence (knowledge in the Chomskyan sense) in^various 
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languages, ranging from ''none 11 in most languages to something, 
indeterminate in at least pne languages. This "indeterminate" 
proficiency in a particular language has no upper limit, but it is 
expected that in the overwhelming number of cases' it is based on 
sufficient competence for most purposes, and most importantly for 
receiving meaningful education by means of -that language (although it 
would probably not be appropriate to require writing or critiquing 
Shakespearean drama as a measure for facility in a language). It is the. 
necessity of having the linguistic skills to benefit from U.S. schooling 
that is the reason behind tests of language proficiency. The major 
impetus for their use* has been the 197 2 * Supreme Court Lau decision, 
which precipatejl the need for such test* to facilitate an equitable 
"meaningful education of non-English-proficient students in the public 
eduction system. 

If the notion of ''comparably limited 11 is taken seriously, the 
common sense notion of linguistic competence, i.e., the ability to 
function well in daily verbal interaction, appl ies to only "some 11 
people. It would mean ^t hat mono! inguals have sufficient competence for 
most purposes in one language, while others^ particularly some, . 
bil inguals (with the broadest possible understanding of the term 
M bil ingual 11 ), do not have sufficient competence in any language to 
determine, the language in which they may best receive a meaningful 
education. This will be further discussed below. For' the moment it is 
riecessar? to pursue the common sense notion of language proficiency in 
order to see what It is based on. 

Linguistic competence involves both speaking and comprehension, of 
which only speaking is directly observable for assessment. 
Comprehension is inferred from the effect of language on the behavior of 
a listener. ( In natural situations this is not always possible since, 
for example, comprehension may on*y be evident in the fulfil 1 ing of a. 
directive that 'to take place at a later time, e.g^"drop me a line when 
you get to London 11 ; and, from the point of view of an i nter locutoir, 



comprehension is often taken for granted so that simple back-channeling 
cues like ''lihurih" or ah aperitive nod are interpreted as understanding, 
whereas this may not necessarily be the case.) While it is beyond doubt 
that a certain degree of language proficiency is necessary in order for 
a person to benefit from instruction in that language, it is not obvious 
(low much proficiency is needed or that past a certain point, the more 
proficiency the better the achievement. ■> 

s : • 

What do language proficiency tests measure ? 

tanguage proficiency tests are based oh various assumptions about 
what to measure and how to interpret results^ Quite generally these 
instruments emphasize structural features of performance and some, in 
addition, the following of flon^f unct ional directions. It has been 
pointed out by some observers that there is a lack of match between 
current concepts of communicative competence, which focus on ability to 
function interact ional ly in a language, and test procedures which 
evaluate aspects of linguistic structure divorced from any social 
context (e.g., Galang, 1980). That is, language proficiency develops 
and is used in everyday social contexts, where functional and pragmatic 
features are interwoven with operation of the formal (structural) 
features of language. In contrast* most tests generally denude language 
structure of its normal functions and place speakers- in a highly unusual 
situation whose effects on linguistic behavior are very problemat ic, 
although largely ignored, in usincj the instrument to assess the 

proficiency of the subject. 

f. 

It is far from certain that the most widely used tests of language 
proficiency accurately distinguish those who are proficient in a 
particular language from those who are not. It is possible, as 
hypothesized below, that those who fail to be recorded as proficient 
include a significant number who are in fact proficient; and it is also 
possible that those who are registered as proficient (or M f unct ional ") 
include a number who are actually not or, at least, would benefit more 



i by partial instruction In another language (the home language). (The 
second point above was brought up by Luis Laosa at the New Jprk City 
hearings for the proposed Federal Regulations guiding funding for 
bilingual educat ion in September , 1980.) 

Moreover, the, two possibilities that both proficient and ^_ 
non-proficient-pupils are overestimated by the same instruments needn't 
be contradictory, since the reasons for overestimation may be different 
in both cases. From a strictly linguistic point of view, Dietrich S . 
Freeman (1979 J discuss a large number of examples from tests wfijch would 
tend to underestimate a speaker's 6r$J proficiency, mainly due to 
scoring procedures for syntactic complexity.- However, they also point 
out defects in various tests where faulty assumptions or design may lead 
to overestimation of proficiency, e.g., some tests generalize "overall" 
proficiency from knowledge of textual ly infrequent words (in elementary 
readers circa 1952), some tasks involve selecting a picture from a set 
of dissimilar pictures, which can be done on the basis of knowledge of 
one or two lexical items rather than complete comprehension of the 
syntactic form of the oral instruction. 

/ 

Several studies indicate that the noh-funct ional ity of the usual 
language assessment instruments may result in underestimation of actual 
language proficiency under the hypothesis that situation is a major 
determining factor of language output (cf . Labov, 1972). For eximple, 
work at the Center for Reading has indicated that Black children's 
output of well -formed utterances and number of words and sentences 
increases in describing pictures from the Peabody test when the subject 
is placed at a distance from the tester so that it is no longer obvious 
to the subject that the test can also see the picture. Removing the 
subject and tester from close physical proximity to each other and the 
tester from access to the picture creates an apparent functional reason 
for greater verbal ity on the subject's part (Guthrie S Steffensen, 
1980). 
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Ongoing research at r the Southwest Educational Development 
Laboratory involving a longitudinal study of the oral language 
development of Texas bilingual children (Spanish-English) in which 
kindergarten and f i rst g?ade' chi 1 dren are periodical ly recorded in the 
domains of classroom (to teacher), playground (to peer) and Home (to 
family member), reports that in both quantity and quality displayed 
language proficiency differs systematically such that greatest . 
proficiency is generally displayed in the playground ( Mace-Mat I uck, 
1980, esp. p. 7). Although in that study only structural 
characteristics of language are being investigated, the implications of 
the effect Of function on language are clear. 

Besides the effect of situation on displayed language proficiency, 
there have been several studies which have thrown doubt on the intrinsic 
Validity of some of the test instruments as indicators of language 
proficiency at all or as relevant indicators of scholastic achievement. 
Extensive criticism of the linguistic assumptions of various tests or 
their particular subparts is found in Dietrich and Freeman ( 1979) - 
Rosansky (1978) focuses extensive criticism on the BSM (Bilingual Syntax 
Measure) for Various measures of validity which apply wholly or in part 
to other tests as well. In addition, several normative studies have 
been injurious to the claims made by some test developers for the 
validity of their instruments. For example, a study in the Houston area 
(Gillmore 6 Dickerson, 1979). in which pairs of five language 
proficiency test instruments were compared for equivalence (BSM; 
LAS— Language Assessment Sacles; PAL— Primary Acquisition of Language; 
SPLIT— Schutt Primary Language. Indicator Testj, and BINL— Basic Inventory 
of Natural Language, the last of which had three successive versions for 
calculating raw score and consequent proficiency category on the basis 
of the same data), found that there was a low to moderate correlation at 
best between tt?e classifications obtained from any two instruments, 
perhaps hot surprisingly since the tests disagree as to what should be 
tested to obtain a proficiency measure. Thus, the distinction between 
1 hnited and non-1 imi ted proficiency (in tngllsh) depends on which test 
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is used. In Addition, there was never a high correlation between any -of. 
ttie proficiency tests and any of the achievement subtests used by the 
Texas Educational Agency for exit criteria fitom a bilingual program. 

Indeed, in the case of the BINL* the correlation between language 
proficiency assessment and achievement score was negative in most cases. 

Another study from the Chicago area (Perlman £ Rice, 1979) focused 
oh the LAS exclusively and found that it identified varying proportions 
of monolingual English speaking students with slightly higher than 
average reading scores (according to the city-wide mean) as non- or 
1 Imited-Engl Ish-prbf icient, such that the proportion of students 
assessed as proficient increases with age. This is to be noted, for the 
. LAS claims to be age- independent up to the age of 11, at which point the 
ceiling for proficiency rises and the study again shows ah increase in 
the number of assessed 1 imi ted-jlhgl i sh-prof ic iency children. The LAS 
identified 77.81 of the monolingual English speakers as limited at age 
5. This decreases continual 1y ;to 12.5* at age 11. At age 12, where LAS 
is renormed, limited prof iciehpy again increased to h0% and then 
*/ gradually decreased to 2$% at age 14. 

While the above-cited studies have been concerned only with 
Engl ish,' there is evidence that the same defects apply to other language 
tests, of which, at present, only instrqjnents in Spanish are 
sufficiently developed to be. examined ,in similar ways; One such study 
(Jackson, 1980) from the Texas bilingual children oral language 
development study, indicates that there is a vast lack of agreement 
between LAS as an assessment instrument for Spanish (mean 2.59) arid 
teachers' ratings of the same kindergarten and first gradS children 
(mean 4.17). In thdt study the overestimation of non-proficiency when 
compared to teacher ratings also applies to EhglJsh, al though.at much 
lower level (mean LAS score, 2.10; mean teacher rating, 2.89) than for 
Spanish. Consequently, there*is some reason to be suspicious of the 
validity of a "comparably limited" category. . 

10 
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Assuming that these problems ultimately derive from confusion and_ 
lack of clarity about what language proficiency in fact consists of, the 
most crucial population against which to consider this issue are those- 

children who are assessed to be comparably Wmited in two languages. In 

_ . . . ~ - ~ - . . - * 

the following sections, the language policy and theoretical 

considerations concerning this population are presented. 

/ 

flow Large is the 'Comparably Limited" Population? 

There is a large but imprecisely known group of LEP (limited 
English proficiency) students in the American school system who are 
currently classified as comparably limited (henceforth CL), that is, 
also limited in their primary ^language. Strictly speaking* 
classification as LEP and/or the subcategory CL~is based on achieved 
performance on a language proficiency assessment instrument, of which a 
large number exist, e.g., four are in current use In the California 
school system, BINL, BSM, LAB, LAS, one of which as been selected 
according to the individual school district. The actual implications of 
LEP and CL classification in terms x of actual language proficiency, 
everyday language use in and out of school, effect on school 
achievement, and on teacher's perceptions and expectations of # such 
labeled students, have not been carefu 1 ly studied and are hot known. 
However, these classifications have been of great importance to the 
proposed legislative guidelines controlling minimal requirements for 
bilingual education. Consequently, it Is of extreme importance to 
discover the realities underlylng.these labels^and their effects in the 
school system. 

The size of the assessed comparably limited (ACL) population among 
LEP students is a matter of some controversy. No systematic study has 
been done in this area. Burt and Dulay (1980) report variable numbers 
of ACL depending on district in a study of several school districts in 
Cal ifornia, ranging from a high of $M% of all LEP students sampled in 
Santa Clara (out of a total of 166 chi Idren in grades K-8 using the BSM 
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instrument) to a low of 10* of *1 1 LEP Students sampled {576 in grades 
K-2 using the BSM) in San Diego. Methods of sampl ing have not yet been 
made clear to us. it is di f f icul.t,- therefore* to te11 : how 
representative these data are. However, they follow a pattern such that 
non-prof iciency in Spanish increases as district*; increase insistence 
from the Mexico/0. S. border area, a pattern which is repeated in other 
studies (cf . Laosa, 1977). De Avila and Duncan (1979) report an ACL y 
proportion of 26.51 of their total sample (56 'out of a 191 sample), 
according to the LAS instrument. This they divide intS two categories, 
1) limited bilingual (20.6$ of total), and 2). late language learner 
(5.9* of total), the difference being that the late learners fail to 
score a 3 or above in either language (of English and Spanish). 

The distribution of ACLs differs according to research site. Given 
the four sites over which the total sample was distributed, the Puerto 
Ri can-Americans (at ah unspecified site in the Northeast, but hot New 
York City) had the highest incidence at 45.6fc and the Cuban-Americans 
(at a site in Miami) had the lowest incidence at 9.3$.\ Of the two 
Mexican-American groups studied, the urban group (at a site near San 
Francisco) had an incidence of 29-9%, while the rural group (At a site 
near El Paso and, thus, near the Mexican border) had a lower incidence 
of 15.7%. From their figures one can discern a pattern such that the 
population of late learners at any sit* grows as the number of limited 
bilinguals increases, as if fhe causes underlying both categories are 
the same. For example,. the Puerto Rican group, with the greatest number 
of limited bilinguals, also had the greatest number of 4ate learners 
(15. 6%), wtjile the Cuban group had no late learners- 

In discussion, the authors refer to a previous study, 
(Linguametrics, 1977) indicating that ACL incidence ranged from 20% in 
suburban studies to "as high as 361 in inner city urban schools 11 (op. 
citi p. 28). It is not clear from the report how the sample was 
actually selected, prohibiting an understanding of its 
representativeness to larger populations. in the Texas longitudinal 
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study using the LAS instrument, 76* of. the kindergarten and first 
graders fit the ACL category (as either "limited bilingual s" or "late 
language learners, 11 the latter being the majority). 

In the fol lowing section possible reasons^ for the differential 
distribution of Ait acco4s different social groups can be discussed, 
under the assumption that such figures are to some extent 
representative. Up to this point, the object has been to present 
available evidence that the category is of sufficient size" to merit 
special interest on its own. We will now proceed to examine how this 
sizeable populat i6n is affected by proposed educational policies. 

To underscore the importance of the issue of the reality underlying 
the ACL category as St applies to the U.S. educational policy, it is 
worth citing at length from the Federal. Register of August 5, '198, 
which discusses the proposed ' rules for JSi 1 ingual education services (pp. 
52056-7): (/ 

(a). . .Limited proficiency students who are clearly 

English superior need not be given specially tailored . 

services. These students rely pr imar i ly on EngH sh. 

It is their strongest language. The type of he lp_ 

they may, need in improving thei r Engl ish ski lis is 

similar to that needed by monolingual^ speakers of 

English who h&ve relatively weak language skills. 

The proposed rules, therefore, simply mandates that 

Engl ish-superior students have the same access to 

compensatory help as other students. 

Pertinent to this citation, a comment is necessary. 
Theoretically* this propbsal seems eminently reasonable. In practice, 
however, further specification Is necessary concerning what "clearly 
English-superior" means. Given that a student is tEP to begin with, 
how much lower does his/her score have to be in the primary language in 
order to rate as "clearly English-superior"? Anp, to take up a theme 
which will be further discussed arid exemplified later, how sensitive 
and fine-tuned does the instrument for language proficiency assessment 
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heed to be in order for differences in scores in two languages to be 
significant (i.e., clear). This is far from an academic question since 
In some cases test scores fluctuate greatly from one testing sessfon to 
another, changing assessment of which is the stronger language. 



To continue with the citation: 



(bj. A . The remaining students— those who *re 
comparably [im[ted and those who are clearly 
primary-language super idr--must receive instruction 
designfed to develop full proficiency in English. . . 



A similar objection must be raised to the word "clearly 11 in this 
passage. In |ffect, although we are rfot challenging at this pdirrt the 
provision for^kv%ra! subtypes of CL students, we are insisting that 
careful research is necessary to insure that these distinctions are 
accurately made, and hot oh the basis of instruments that are not 
clearly defined nor equal to the task. 

Id—- ■ ■ - - _ 

The next passage defines the only group of LEP students who will 

unequi vocably get any support for bilingual education: 

(c). . . Because students with weak English skills may fall 
permanently behind in other required subjects whi 1 e they 
are learning EngFtsh, the proposed rules require that 
v primary-language-superior students receive instruction 
through both languages [n required subjects while the 
students are learning EngUsh. Instruction in English may 
be increased as the student's command of the English 
language i ncreases. 

At this point we come to the m&st important segment of the 

citation with regard to the treatnjent of 6b students: 

- / - 
Two alternatives are presented [n the text of the proposed 
rules to stimulate comment on whether comparably \im\te4 
students should also receive this Instruction. These 
alternatives are presented because experts disagree about 
"which placement is best for comparably I imi ted student*. 
Some educators argue strongly that student^-who are 
comparably Hmi ted jn two languages wi 1 1 have a difficult 
time in a monolingual, English-speaking class. They also 
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argue that the skills that 1 imi ted-Erigl i sh-prof iclent 
students possess Tri their own language are not necessari ly 
the same as those possessed in Engl i sh. , For example, 

vocabulary or grammar .skills may be different in each * 

language.. Others object , to this assumption and argue that 
these students will do better in the long run if their 
English skills are sharpened by instruction offered 
exclusively through English. 

For the moment we must suspend discussion of who these experts and 
educators are, and to what texter\t their arguments are based on 
speculative or empir feat considerations. It is clear^from the passage 
that nothing has been resolved about this problem. No research has 
been done with sufficient appeal to legislators to resolve it. 

Until research is done in thus area, a large number of students 
fitting Into the CL category are in danger of being misclassif ied and 
deprived of the meaningful education to which they are entitled under 
Title VI of the Civil Rights Act of 1964 and the Supreme Court Lau 
decision of 197^. 



the following sections de^l with the theoretical considerations 
and initial plan o v f work for research on the problem of CL students. ■ 

What Does "Limite d L angua ge Proficiency 11 Mean ? 

Despite its long history and widespread use, the notion of 
"limited language proficiency" is. unclear and ambiguous, in the 
context of current bi 1 ihgual educat ibh and the Uu, gui del ines, "l imited 
language proficiency" refers to failure on the part of the individual 

_ . --4- 

student to attain a certain cutoff point bh a given language 
proficiency test •instrument. Therefore, one point of contention can 
be— and is— where the cutoff point separating 1 imi ted I from non-1 ftni ted 
language proficiency should be. in addition, given that different 
tests are given in different districts to assess language proficiency, 
to what extent do the cutoff points agree across tests? _ If they do 
not, then a student may be classified as limited on one test and 



non- limited on another. Which test is more accurate? A study of tb ,s 
type for the four tests used in the state of California has been 
commissioned by California's Office of Bi 1 ingual/Bicultural .Educatiol 1 
and should be available shortly. This information will be reviewed 
when available. The Gi 1 lmore arid Dicker son (1979) study in Texas ha* 
already been mentioned above and does not bode well for comparability 
across tests. 

Another question to be raised is, what are the sociol ingui stic 
implications of the classification "limited language proficiency' 1 on ^ 
^tests for actual language proficiency? There are various * 
possibilities, none of which can be dismissed a priori: 

A) Problems inherent in the child. 

B) Problems in test administration. 

C) Problems in test design. 

A) Pj^Ums inherent in the child . This is the possibility tfi** 
the child actually lacks necessary proficiency in the relevantly t£$ tetl 
language. The meaning of the word "relevantly" in the above sentenc^ 
will be clarified when the difference between standard and nonstanda rcl 
varieties of a language is discussed. For the moment we heed drily n^ te 
that the purpose of language proficiency tests is to determine wh*th^ r 
or not the child has sufficient knowledge of the particular language 
tested to receive educational instruction in that language. To .the 
extent that possibilities B) ^rid C) above are responsible for scores 
indicating lack of proficiency, the tests fail to correctiy and 
adequately assess the child's proficiency in the tested language. 
Before discussing these possibilities, we will begin with \f 
considerations under A). , / 



The most serious cases of limited language prof ic iency /appl y to 
monolingual as well as bilingual children. These are the cases in 
which limited language proficiency is due to some biologicjal ly based 
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impairment; *.g. autism, a neurological impairment, some other 

_ - . _ _ - - -\s - ■- - - - -'- - - - — 

physiologically based form of mental retardation or schizophrenic 
condition; in these cases a child will fail to acquire language 
despite a supportive environment and will inevitably test on any 
proficiency instrument as limited and, furthermore, will heed some 
special yfoFrtT^^ for the handicapped rather than necessarily 

bilingual oK intensive English training. Too large a percentage of 
cases of CL proficiency is reported to make this possibility plausible 
*6r Che majority of cases involving bi Unguals (cf. At tanas i et al., 
1978; liurt I Du»lay,y!980b; Skutnabb- Kansas S Toukomaa, 1976; de Avila £ 
Duncan, 1^04_Jlac€^Mat luck, 1980). 

An eyen less plausible" pbssibi I ity is that the CL is of normal 
capability to acquire language but comes from an impoveir i shed 
background in which s/he was given insufficient exposure to any 
language. This idea is embodied in the "verbal deprivation" theory 
popular among educational psychologists in the late 1960s but largely 
discredited due to massive sociol ingui stic and dialectblogical research 
oh nonstandard languages of non-middle class populations (among these 
studies tabov, 1968 and 1972, are classic). The only valid cases where 
"verbal deprivation" is applicable are extremely rare cases of 
antisocial upbringing, such as Genie and other "wol fchi ldren" who fail 
to acquire language because of extreme deprivation caused by virtually 
total isolation from human contact. Such cases are immediately 
recognizable because victims of this treatment fail to acquire all 
other social behaviors in addition to any form of language. Such cases 
are not an issue here, but indicate where a serious form of the "verbal 
deprivation" theory has some validity. 

Sociol ingui stic and psychol inguistic research over the last two 
decades has amply demonstrated that the vast majority of children in 
any linguistic community acquire the language/of that community 
regardless of caregiver practice in overtly teaching the child to speak 
or even speaking to him/her. However, the variety of language spotcen 



in the -cbSnunity may be^quite different from the standard language of 
the schools serving that corSnunity. To the extent that this problem Is 
reflected in low scores bh a language proficiency instrument, the 
problem is one of test design and not the child. This will be further 
discussed rnder the appropriate heading (C). 

While "verbal deprivation 11 has been massively discredited for 
children of the U.S. Black community due to the .research of tabov and 
others, it has been applied in a somewhat rev S sed form to children in 
bilingual communities. The general form of the argument i$ that, in 
some sense, many children who acquire language in a fcj 1 ingual community 
da not get adequate exposure to either language, perhaps because they 
arevunable to separate out which language is which, or because they are 
exposed to intrasentent ial codeswitching and other forms of language 
mixing in which two languages are inextricably mixed in the same 
context. Some scholars p^int to such codeswi tchi ng as the result of^ 
the inability to control either language adequately (cf. Gonzales* 
1977). This matter needs more exploration* since scholars are not at 
alt clear on what codeswi tchi ng is or on how many types there are* and 
imply that. there may be inherent disadvantages to acquiring language in 
a bilingual community, a claim which would be patently false if applied 
to any particular monolingual community. One piece of contradictory 
evidence is Pop lack, 1979, who presents extensive evidence for a Puerto 
Rtcan btl ingual community in New York City that, contrary to the 
"codeswitching as limited proficiency in either language" theory, 
codeswitching in that community is a style of speech sensitive to 
social situation where intrasentential codeswitching is typical of 
speakers highly skilled in both languages rather than of those who are 
limited in one or both languages. ► 

Given what is currently known about language and its acquisition, 
it is highly unl ikely that the normal child (of bilingual as well as 
monolingual background) Is limited in all of the languages s/he has 
been exposed to. -If s/he is limited in one language, it is likely that 

is,* 
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s/hfc is relevantly proficient in another. Here "relevantly proficient 11 
means proficient for age-level in the ianguage(s) used as the primary 
vehicie(s) of communication in the particular community. This language 
(or these languages) may be fionstaraiard , i.e., not recognized as 
Independent languages (as opposed to dialects of a language) by the 
institutions serving 'the community (e^g., the school system). It may 
not" be written. It majy even not be overtly valued by adult users 
themselves due to insecurity caused by pressures to conform to official 
norms. However, in terms of language acquisition, there is no evKdence 
that nonstandard language is any different from a standard one. It is 
totally false' to claim that someone who knows a nonstandard language -' v 
but not a standard one does not have proficiency in any language. \ 
Since it is unlikely that the large number of children classified as CL 
are less normal in language acquisition ability than their monolingual 
peers and do not have proficiency in any language, it is unlikely for 
most of these children that the category CL is a valid' one. 

We point but here that the category CL is a conceptual offspring 
of the term "semi 1 ingual" popularized by Skutnabb-Kangas and Toukomaa 
(1975, 1976) in their studies of the educational achievements of the 
children of Finnish migrant workers in Sweden and still widely used by 
Scholars in Europe arid Asia. The authors define a semi lingual as M a 
person who does not know an^ language properly. 11 It may be the case 
that "properly 11 means "standardly'' in this context, But this is not 
clear from the exchange between the authors (1979) and Brerit-Palmer 
?0979)* In any event, popularization of the\concept, although not the 
term, is evident in Burt and Du lay's report (\l980) on their findings 
for Hispanic children in selected districts in California. It is 
ironic that whereas Skutnabb-Kangas arid Toukomaa used their findings to 
support home language instruction programs through the early grades, 
Burt and Dulay imply that Hispanic CLs should be treated as t.1 English 
speakers (although given the baa procedures we know their home language 
Is reported as Spanish). While it may be the case that\some of the 
Hispanics diagnosed as limited proficiency in English and formerly 
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assumed to be fluent in Spanish are in fadt English-superior, and not 
in need of bilingual education any more than monolingual LEPs, the CL 
category cuts both ways. It is just as possible that a particular CL 
is Spanish-superior even though she/he tested as limited in Spanish, \^ 
and that she/he would benefit more from bilingual education than from 
English immersion. 

B) Problems in test administration . The situation in which a 
test for language proficiency is administered presents if number of 
areas in which problems which interfere with, accurate identif icatiqn of 
language proficiency can be located. Two problem areas immediately 
suggest themselves, of which the first two present the most serious 
challenges to solution: * 

1. Rest stance/unf ami 1 iarity on the part of the child. 

2. Mechanical and other errors on the part of the tester. 

Problem area #1 : Res i stahce/urifam44^a^j tv on-the-pari of the 
chi Id. Much sociol inguistic and ethnol Ingui st Ic research has 
demonstrated the important effect that social situation has on speech, 
not only in the kind of language which is produced oh different 
occasions but also oh the volume of language. With particular 
reference to the "verbal deprivation 11 issue, Labov (1969) demonstrated 
that change in situation can have a decisive effect on whether a child 
may exhibit monosyl labile, limited proficient behavior, or fluent, 
syntactically rich proficient behavior. Children confronted with a 
situation which may be interpreted as threatening by them e.g., a 
formal test situation in school, may adopt the strategy of saying as 
little as possible in order to protect themselves from imagined 
\self-incriminat ion. To the extent that fluency measured in words (or 
Content morphemes) per sentence is used to judge 1 anguage /prof i c i ency , 



as li is, for example* on the BINL (Basic Inventory of Natur.al 
Language), underestimation of language proficiency may be the result of 
such defensive behavior. It is expected that a review of the 
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literature oh test anxiety for bi Unguals and other minority 
populations not accustomed to the types of testing situations and 
Intentions in these situations wi 1 1* farther elucidate this problem* (cf . , 
Kenneth H • 1 1 ^ Institute for Child Development at U. of Illinois). In 
attempting to solve this problem, information is needed on the 
situational contexts that maximize speech for such children. 

Problem area #2: Mechanical and other errors on the part of the 
te ster . Mechanical errors include unreliable transcription, coding and 
calculation of scores by the tester. It is not clear how much error of 
this type there is. However, an examination of test instruments like 
the BINL arid BSM (Bilingual Syntax Measure) reveal 'ample opportunity 
for such error. Those who have worked with transcription of actual 
speech production, which the above-mentioned instruments use as data, 
are aware of the difficulties and painstaking efforts required for 
accurate transcription. A large source of error may be expected in 
transcribing* what Is actual ly sa i d. Another source of^error can be 
found in the next stage of coding (assigning a numerical value to) what 

said. In some cases this involves more than a simple count; the 
coder must recognize certain syntactic structures which are to be a 
assigned a point value. Analysis of syntactic structures from speech 
Is often problematic to linguists and would be of much greater 
difficulty to non-1 inguists. Finally, actual calculation of language 
proficiency scores from completed tests is ^>dint for possible error, 
including gross error, and no doubt also a source of irritation to many 
reluctant testers. In point of fact, the dimension^ of this problem 
hive been such that ChecPoint Systems, Inc., the producers of the BINL 
test, have initiated a program of computerized calculation of language 
proficiency scores from transcription in order to accomodate i!ie 
^complaints of consumers of the test. Since there will still be some 
margin of error according to this process, and syntactic structures 
will have to be parsed for point assignment in calculating the scores, 
we are interested In finding out more about how this is being done. 
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Other sources of error, which anticipate the fbl lowing section oh 
test design, include misunderstanding of nonstandard vernacular .Spanish 
responses by testers trained only in standard literary Spanish, 
resulting in dismissing a nonstandard response and requesting another 
response, or in inaccurate coding of the given response. 

C) Prob 1 ems In test design. Since the purpose of the language 
proficiency test instruments is to test language proficiency, defects 
In the design of the instrument will result in inaccurate conclusions. 
A general definition of language proficiency is a problem. To begin 
with, it is not at all clear that all forms of language proficiency are 
testable under formal conditions, since the normal everyday use of 
language occurs under conditions In which attention is hot focused on^ 
language. Furthermore, in everyday situations, language is not being 
overtly tested but Is being used to facilitate communication and social 
interaction. The recognition that only a small proportion of the 
language proficiencies a person may use in everyday life are directly 
testable may prevent the gross misunderstanding of the results of 
language proficiency testing. A narrow understanding of language 
•proficiency may be sufficient for test purposes. The crucial purpose 
of language proficiency testing is to determine whether or not a child 
has sufficient proficiency in a particular language to be given 
meaningful educational instruction in that language. In addition to 
the question of the validity of the cutoff points used to evaluate^ 
proficiency, already discussed above and subject to study, there are 
three problem areas in test design that can be identified; 

1. Irrelevant or wrong questions or tasks for assessment. 

2. Built-in discriminatory factors against nonstandard 
language features. 

3. Uncontrollable fluctuations in performance on the part of 
the testee, giving a false impression of stable 

~? competence. 

2Z 
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Problem area #1: — t^elevant^or-wronq questions or .tasks for 
assessment . To some extent there is overlap between 1 and 2 above, 
since a factor which biases testing against nonstandard language * 
features can be argued to be ah irrelevant or wrong question. However, 
clearly fitting into this first problem ^rea is the problem of 
selecting which features, out of- the universe of language or 
ianguajge-reiate.d proficiencies, should 6e tested in order to determine 
whether the tested language Is sufficiently developed in the child to, 
be thebest medium of instruction in favoring school achievement. 
There is continuing controversy in the field of language proficiency 
testing concerning cbmmensurabi 1 i ty between* test results on different 
aspects of language prof iciency, . e.g. , phonological and -grammatical 
knowledge. For example, the issue has been raised whether there is an 
interrelationship between results of discrete-point tests (particular 
linguistic features) and integrative tests (a battery of different 
linguistic proficiencies) (cf. Farhady, 1979). 

Perhaps the most radical pojnt of view is reflected in the work of 
Oiler and Perkins (1980), who claim that all forms of language 
proficiency and a number of other proficiencies are commensurate. In a 
series of papers, Cummins (see, e.g., 1979, 1979b, 1980, 1990b) has 
contested this view and distinguished two types of language 
proficiency, one of which is related to scholastic success and the 
other of which is not. Cummins claims that most forms of language 
proficiency are irrelevant to school achievement. BasicaHy, his 
argument is that almost all children are competent in interpersonal - 
communicative skills, but much fewer children are successful in school. 
This calls into question whether tests for fluency which count such 
things as MUI- (mean length of utterance) —a factor, for examp 1 e , on the 
BINL test — are relevant to ability to use a language for school 
achievement. While Cummins 1 basic proposal seems reasonable, it does 
hot necessarily fol low that success in school is at all related to ar>y 
kind of language proficiency. In personal communication with Cummins 
(August, 1980), this" writer has found that he does not seem adverse to 
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conceding that what he has been calling CALP (Cognitive/Academic 
Language Proficiency) , and claiming is direitly related, to school 
achievement* may not be a 1 anquaqe proficiency at all, but rather 
something else not yei clearly defined. However, continued use of the 
term CALP may result in misunderstanding of just what this feature or 
combination of factors is. In addition, Cummins continually alludes^in 
his work to the observations of Skutnabb-Kangas and Toukomaa^ who in 
their discussion of "semi 1 i ngual s" note that there is no correlation 
between the results of their ''language proficiency" testing of the 
children of Finnish migrant workers in Sweden, which show the children 
to test lower than their monolingual Swedish and Finnish counterparts, 
and the observations of parents, teachers and the children themselves 
that the children are fluent In both languages. 

Skutnabb-Kangas and Toukomas' conclusion that the apparent fluency 
and proficiency of these children is a "linguistic facade" and / 
Superficial" is extremely dangerous* especially in a social science, 
sfrice it dismisses as inappropriate any common sense notions about 
language proficiency available to a majority of observers, including 
most teachers. it leaves the determination of the important aspects of 
"language proficiency" in the hands of a few "experts". For example, 
if taken literally, this notion would invalidate the section of 
California's new Bilingual Education improvement and Reform Act of 1980 
which deals with procedures for investigating possible Ct cases: 

(c) For those pupils identified/as bierig of 1 jmjted English 

proficiency, a further assessment shall be made to - 
determine the pupil's primary language proficiency, 
including speaking, cbmprehens ion, reading, and wri ting 
to the extent assessnMBnt^/instruments are available i . . 

^k I f the assessment" 'WnTf^Teff' pursuant^ to thi s sabdi v i s ion 
indicates that the pupil has no prof icjency In the 
primary language, further assessment of the pupil's 
primary language ski l is including consultation wjth the 
pupil's parents and guardians, the classroom teacher, the 
pupil, or others who are familiar with the pupi l's 
language ability In various environments shall" be 
conducted. . . (State of California 1973-80 A.B. 507 
SEC. 9) 
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While the research on CLs which is proposed in this paper will be 

1 . . _____ 

much concerned with being relevant to how to proceed in the further 
assessment of such cases, it is felt that there are no grounds, despite . 
the theoretical position of Sfcutnabb-Kahgas and Toukomaa, and possibly 
CUmmihs, to preclude the procedures recommended beginningon line 1*» of 
the above cited document. On the contrary, the requirement that such 
further assessment be done wit 1 be invaluable data for further 
elucidating the problem. It is also worth mentioning, as Leap (1980) 
observes, that in the writings of Skutnabb-Kangas and Toukomaa there 
are no examples of ac tua 1 speech behavior of these so-called 
"semi I inguais." We know nothing more about their language 
proficiencies than their collective test scores on certain test 
instruments. (See Dietrich & Freeman, 1979, for extended discussion of 
possible problems in assumptions underlying actual .test items.) 

Problem a^ea 42a BuiU^ln jtscr (minatory factors against 

nonstandard language features . One most potent possibility for 
inadequate test design is linguistic prejudice against nonstandard 
varieties of speech being built into the test instrument (as well as 
into the test situation, as mentioned above under that section). If 
low test results are interpreted as meaning lack of proficiency in ANY 
variety of the tested language (rather than simply in the standard 
•variety), as is often the case for prescript ively inclined educators 
who lack knowledge of nonstandard varieties of a language and/or are 
under the misconception that these are distorted or degenerate versions 
of the standard laguage, then the problem is in the test design. Tests 
which have these features are actually culturally unfair and biased 
because they dismiss language proficiencies which derive from the 
child's cultural background, iri effect dismissing them as worthless. 
For example, the BSM, which dwells on the inflectional morphology in 
English (aqeJ other languages) , gives /points for many inflections which 
are characteristically absent or rafe in the speech of many Black 
monolingual English speakers, e.g., tfrird-person singular and the 
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possessive marker of standard English (*s). Many Black students taking 
the BSM/test would score as LEP, although this would be culturally 
biased assessment of their language proficiency. Similarly, this 
defect applies to Hispanic bi Unguals and appears to be based either or: 
a lack of knowledge or oversight »in the test design. For example, for 
the Spanish BSM the tester is cautioned riot subtract points for 
omission of final in Spanish, since^t Is recognized that final ^£ 
deletion is a widespread dialect feature of many varieties of Spanish. 
Thus, the pronunciation of a word like Jjis the feminine plural article, 
as homophonous with \a may be a pronunciation feature rathar than a % 
sign of failure to have acquired the plural morpheme. However, English 
students are penalized for failure to pronounce P^st tense suffix 
- ed despite copious documentation that when -ed is expected to occur 
immediately after a consonant, as in a word 1 ficed stopped , it is likely 
to be deleted for phonological rather than grammatical reasons in most 
varieties of spoken English. 

There is possibility that any test design may be inadequate 
because it is addressed to the wrong language or variety of a language, 
causing the false impression that the speaker is actually deficient in 
any language. With respect to this, some scholars have raised the 
notion that in some areas a "mixed" language historically deriving frdm 
Spanish arid English may have arisen so that one could not classify the 
language as either Spanish or English. This Is distinct from the # 
phenomenon of code-switching discussed previously, since code-switching 
implies that the two languages are separate at least on some occasions, 
although they may alternate even within the same sentence on others). 
De Avila (1979) following taosa (1975) discusses this possibility 
referring to the mixed language as "Spangl i sh. M It remains very much 
to be seen whether such a language exists as a on if ied code * Very 
little research on the subject exjsts. One of the few studies which 
directly addresses the problem is Huerta ( 1 977) • She studies a ; 
two-year old boy (until 2:10) from a middle-class Background, who Is 
primarily exposed to Spanish/English 
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code-switching from members of his family rather than Spanish and 

English separately. She demonstrates that the boy acquires both 

f- --------- 

. Spanish and English vocabulary and realizes that there are synonymous 

relations between pairs of words, e.g., "orange 11 and "naranja", 

"coffee" and "cafe" etc. She also claims that his syntactic 

development In both languages is comparable to what is known about 

monolingual speakers of either language at the same age. Jn 

concluding, she claims that despite these observations the child shows 

no evidence of recognizing that there are two' languages Involved. It 

remains for further research to establish whether a single unified 

language can evolve from extensive exposure to code-switching and 

whether such a unified language can be maintained in any social 

envi ronment in' the U.S. There is no evidence In Huerta's study to 

decide whether the child is dealing with one language or two from his 

own perspective. As for "Spang 1 i sh, " the term has taken on 

condescending and derogatory connotations for many people and is used 

alternately to mean either code-switching, In which case two languages 

are involved, or the varifety of Spanish spoken in many Mexican-American 

communities, in which case a nonstandard form of Spanish is involved 

with obvious Influence of English historically (but not necessarily on 

the level of the Individual speaker). In either case, from the 

perspective of the standard language, both cocje-swl tching and 

nonstandard Spani s^ tend to be overtly stigmatized and misinterpreted 

by the unwitting as a verbal deficit. * 

In the proposed research it will be Important to study to what 
extent changes of situation change the^prbpbrt ibh of the Spanish and 
English in conversation so that it can be determined whether or not 
Spanish and English act independently of each other and, thus, belong 
to separate language systems for individual users. It is interesting 
to note that in the Texas longitudinal study the alternating use of 
Spanish and English is sensitive to situation for the young school-age 
children, so that alternating language is most favored in the 
playground and least in the classroom. In addition, most switching 
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appears to be either at the sentence level or of single nominal s 
(Identified as "lexical") within the sentence. Since these children 
appear to be much stronger in Spanish than in English* as a whole, arid 
close to the Mexican border, the situation might change considerably at 
an older age and in an area more jJistant from the border, such as tos 
Angeles, New York CTty or Chicago. 

Problem area #3; Uncorttrol fable fluctuations in perfbr marice-oft 

the part of the testee. giving false irefiress4on of s tabXe-Competence. 
Uncontrol lable fluctuations In performance are real factors that 
Interfere with getting ari accurate assessment of a constant or base 
proficiency in any one testing period. Test designers have been aware 
of this problem arid have cautioned testers against submitting children 
to test situations when they are ill or in some other visible way not 
in their "normal condition," Certainly the problem of ascertaining 
when a child Is In the best condition to take a test is a problem of 
test administration arid not test design. However, some fluctuations 
stn_1_Jittie understood may interfere with consistent performance on a 
test from one day to the next and may be inherent in the test design. 



I ssues and-QuestXons 

Considered as a whole, this section has presented a formidable 
array of possible problems in obtaining accurate results from available 
language proficiency assessment instruments, t it is essential that 
research arid analysis be addressed to all of these possibilities, and 
that evaluation be made of which of these problems are the most 
prevalent arid most urgently in need of correction. 

To add substance to this theoretical section a few of the types of 
cases that have come to my attention are here presented. 

MA, 8, a third-grade female born in tos Angeles, has tested as 
non-proficient In both English and Spanish. Does she need special 
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education because of a biologically based Impairment or a serious 
psychological defect? Did she simply refuse to talk? Although her 
home language was reported as Spanish, is there another language in 
which she is proficient? 

JR, 8, a third-grade male born in Mexico, tested as limited in 
English and non-proficient in Spanish. He has been in the LA school 
system since kindergarten. Is he actually more proficient in English 
than in Spanish? 



RV* 10* a fifth-grade male born in Mexico and entering the LA 
school system the previous year. He tested as a non-speaker of 
English* as might expected, but also as a limited speaker of Spanish 
witfi a score of 38.6 on the BINL. What is his Spanish like? What 
happened during the test situation? How did he view it? 

FR, 10, a fifth-grade male born in Lbs Angeles and in the same 
classroom as RV above. He tested as limited in both English and 
Spanish. Which is his stronger language? His Spanish score of 74. 3 is 
considerably higher than his classmate RV's. Is he actually more 
proficient than RV in Spanish or is he simply more familiar with the 
test situation? 

PS, 10, a fifth-grade female born in bos Angeles. In 1978 she 
tested as proficient in English according to the BINL. Inadvertently 
she was tested again in 1980, although there is no further procedure 
for proficiency /testing once a child scpres proficient. In 1980 she 
tested as a 1 i mi ted English speaker. This case was an accident, but 
either she was given someone else's score* initial ly, or for some reason 
she did not perform at capacity on the retest ing. 

There are many other children that can be represented here. One 
thing we want to know is: How many? How many different types are 
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there? What is the reality underlying these test results? What 
language should they receive their primary instruction in? 

Summary 

1. The size and extent of the group labeled Assessed/Al leged 
Comparably Limited (ACL or M semi 1 ingua j "J is not well 
documented. 



2. Legislators and educators are in need of research 
information on ACL pupils in order to formulate an 
equitable educational policy. 



3. The results of language prof !c[ency assessment 
Instruments are open. to quest ion for accuracy and 
validity on a variety of bases, of which the most salient, 
are a) norming ca[ibrat|bn (what population do they rely"" 
on to norm language proficiency instruments?); b) 
equivalency (the extent to which different instruments 
agree in their assessment of any individual or group of 
ch|idren)|c) Intrinsic design (what they measure and how 
they score); and d) administration (effect of situation 
on performance). 

4. The not|on of language proficiency has a common sense 
base but is not clearly developed. / 

5. The actua|_speech behavior of the ACL arid other children 
from bil lingua I and non-Engl i sh speaking communities in 
the U.S. (especial ly between grades 3 and 6, ages 8 
through 12) has not been systematically studied. 

Hypotheses 

1. Except for extremely high scorers on most language 
proficiency instruments, the instruments are not an 
accurate indicator of actual language proficiency or 
actual and potential school achievement. 

2. Natural istic discourse and interactional stud|es of 
actual speech behavior will provide a more accurate 
measure of actual language proficiency and will be more 
useful as a diagnostic tool. 

t_ 

3. The ACt category is largely an arUfqct of inaccurate 
instrumental measurements, it is the result of cultural 
and linguistic bias in testing. 
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School achievement dSes not correspond, in any simple or 
direct way to either tested or actual language 
proficiency. That is, whether or nbtjahguage is also 
involved, there are invariably other factors which are 
partially responsible for school achievement. 
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